Picture for Zhao Song

Zhao Song

T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models

Add code
May 08, 2025
Viaarxiv icon

T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation

Add code
May 01, 2025
Viaarxiv icon

Attention Mechanism, Max-Affine Partition, and Universal Approximation

Add code
Apr 28, 2025
Viaarxiv icon

Discriminator-Free Direct Preference Optimization for Video Diffusion

Add code
Apr 11, 2025
Viaarxiv icon

Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents

Add code
Apr 11, 2025
Viaarxiv icon

Provable Failure of Language Models in Learning Majority Boolean Logic via Gradient Descent

Add code
Apr 07, 2025
Viaarxiv icon

Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models

Add code
Apr 05, 2025
Viaarxiv icon

Exploring the Limits of KV Cache Compression in Visual Autoregressive Transformers

Add code
Mar 19, 2025
Viaarxiv icon

Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency

Add code
Mar 18, 2025
Viaarxiv icon

Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers

Add code
Mar 14, 2025
Viaarxiv icon